Natural Language Processing Techniques for Extracting and Categorizing Finding Measurements in Narrative Radiology Reports.
نویسندگان
چکیده
BACKGROUND Accumulating quantitative outcome parameters may contribute to constructing a healthcare organization in which outcomes of clinical procedures are reproducible and predictable. In imaging studies, measurements are the principal category of quantitative para meters. OBJECTIVES The purpose of this work is to develop and evaluate two natural language processing engines that extract finding and organ measurements from narrative radiology reports and to categorize extracted measurements by their "temporality". METHODS The measurement extraction engine is developed as a set of regular expressions. The engine was evaluated against a manually created ground truth. Automated categorization of measurement temporality is defined as a machine learning problem. A ground truth was manually developed based on a corpus of radiology reports. A maximum entropy model was created using features that characterize the measurement itself and its narrative context. The model was evaluated in a ten-fold cross validation protocol. RESULTS The measurement extraction engine has precision 0.994 and recall 0.991. Accuracy of the measurement classification engine is 0.960. CONCLUSIONS The work contributes to machine understanding of radiology reports and may find application in software applications that process medical data.
منابع مشابه
Extracting Imaging Observation Entities in Mammography Reports
Since radiology reports are created as unstructured text reports, Natural language processing (NLP) techniques are needed to extract structured information from reports to provide the inputs to information systems. The goal of this project is to develop NLP methods to extract the Imaging Observations and their modifiers from free-text mammography reports in order to provide structured data to r...
متن کاملA Machine Learning Approach for Identifying Anatomical Locations of Actionable Findings in Radiology Reports
Recognizing the anatomical location of actionable findings in radiology reports is an important part of the communication of critical test results between caregivers. One of the difficulties of identifying anatomical locations of actionable findings stems from the fact that anatomical locations are not always stated in a simple, easy to identify manner. Natural language processing techniques ar...
متن کاملTowards a Multilingual Financial Narrative Processing System
Large scale financial narrative processing for UK annual reports has only become possible in the last few years with our prior work on automatically understanding and extracting the structure of unstructured PDF glossy reports. This has levelled the playing field somewhat relative to US research where annual reports (10-K Forms) have a rigid structure imposed on them by legislation and are subm...
متن کاملClassification algorithms applied to narrative reports
Narrative text reports represent a significant source of clinical data. However, the information stored in these reports is inaccessible to many automated decision support systems. Data mining techniques can assist in extracting information from narrative data. Multiple classification methods, such as rule generation, decision trees, Bayesian classifiers, and information retrieval were used to ...
متن کاملClinician-Driven Automated Classification of Limb Fractures from Free-Text Radiology Reports
The aim of this research is to report initial experimental results and evaluation of a clinician-driven automated method that can address the issue of misdiagnosis from unstructured radiology reports. Timely diagnosis and reporting of patient symptoms in hospital emergency departments (ED) is a critical component of health services delivery. However, due to disperse information resources and va...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Applied clinical informatics
دوره 6 3 شماره
صفحات -
تاریخ انتشار 2015